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ABSTRACT 

Spontaneous lytic reactivation of Kaposi's sarcoma- 
associated herpesvirus (KSHV) occurs at a low rate in 
latently infected cells in disease and culture. This 
suggests imperfect epigenetic maintenance of viral 
transcription programs, perhaps due to variability in 
chromatin structure at specific loci across the popu- 
lation of KSHV episomal genomes. To characterize 
this locus-specific chromatin structural diversity, 
we used MAPit single-molecule footprinting, which 
simultaneously maps endogenous CG methylation 
and accessibility to M.CviPl at GC sites. Diverse 
chromatin structures were detected at the LANA, 
RTA and vlL6 promoters. At each locus, chromatin 
ranged from fully closed to fully open across the 
population. This diversity has not previously been 
reported in a virus. Phorbol ester and RTA transgene 
induction were used to identify chromatin conform- 
ations associated with reactivation of lytic transcrip- 
tion, which only a fraction of episomes had. 
Moreover, certain chromatin conformations corre- 
lated with CG methylation patterns at the RTA and 
vlL6 promoters. This indicated that some of the 
diverse chromatin conformations at these loci were 
epigenetically distinct. Finally, by comparing chroma- 
tin structures from a cell line infected with constitu- 
tively latent virus, we identified products of lytic 
replication. Our findings show that epigenetic drift 
can restrict viral propagation by chromatin compac- 
tion at latent and lytic promoters. 



INTRODUCTION 

Kaposi's sarcoma-associated herpesvirus (KSHV) infec- 
tion causes sarcoma and lymphoproliferative disorders 
in immunocompromised individuals [reviewed in (1)]. In 
the latent phase, KSHV genomes reside within the nucleus 
as multicopy circular extrachromosomal elements or 
episomes. Both micrococcal nuclease cleavage patterns 
and immunoprecipitation of histone proteins demonstrate 
that episomal DNA is incorporated into nucleosomes 
(2-4). Lytic reactivation, thought to be triggered by 
multiple cellular stress pathways in vivo, is associated 
with changes in chromatin structure (3,4). Furthermore, 
histone deacetylase and DNA methyltransferase (DNMT) 
inhibitors can reactivate the lytic cycle in latently infected 
cell culture, as can phorbol esters, e.g. 12-(9-tetra- 
decanoylphorbol-13-acetate (TP A), which are protein 
kinase C agonists [reviewed in (5)]. Thus, episomal chro- 
matin contributes to maintenance of the latent state and 
the repression of lytic transcription. 

Chromatin intrinsically regulates gene expression, as 
many transcription factors exhibit reduced access to 
binding sites incorporated into nucleosomes [reviewed in 

(6) ]. Furthermore, chromatin components often have 
post-synthesis chemical modifications (DNA methylation 
and acetylation, methylation, etc. of histones) that can 
activate or repress transcription locally, as well as recruit 
their own modifying enzymes. Chromatin modifications 
can thus constitute a self-perpetuating state. Such states 
are termed epigenetic if maintained throughout the cell 
cycle, as in the case of DNA methylation [reviewed in 

(7) ]. Epigenetics underpin the differentiation of isogenic 
cells during development, as well as inactivation of one 
of two X chromosomes in female mammals. Imperfect 
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maintenance of chromatin states, i.e. epigenetic drift, may 
contribute to aging and cancer [reviewed in (8)]. 

Epigenetic drift could be important for viral biology 
as well. DNA methylation gradually builds over much of 
the KSHV genome after de novo infection (3), indicating 
that epigenetic changes occur. However, the phenotypic 
consequences of this drift are unknown. Generally, DNA 
methylation is linked to transcriptional silencing [reviewed 
in (9)], suggesting that it could inactivate the virus. On the 
other hand, it might protect against the host innate immune 
response to unmethylated DNA (10). 

Recently, it has become possible to characterize hetero- 
geneity of chromatin structure at select loci within popu- 
lations of mammalian nuclei by treatment with 
recombinant M.CviPI DNMT [reviewed in (11-13)]. 
This enzyme, which we cloned from Chlorella virus, 
methylates cytosine in GC dinucleo tides to G-m 5 C (14). 
The methyl marks can be read by bisulfite genomic 
sequencing [BGS; (15)] and unequivocally distinguished 
from endogenous mammalian methylation at H-m 5 CG 
(by convention, H equals A, C, or T). As nucleosomes 
and DNA-bound factors block access of exogenous 
DNMT probes to DNA (16-19), the resultant GC methy- 
lation pattern is used to infer chromatin structure. The 
combination of high-resolution probing with M.CviPI 
and BGS is termed MAPit, for methylation accessibility 
probing for individual templates, developed by our labora- 
tory (20). By sequencing clonally expanded individual 
molecules, MAPit allows detection of multiple chromatin 
states at selected loci within a cell population. This is in 
contrast to techniques such as chromatin immunopre- 
cipitation, which report the population average 
[reviewed in (11)]. Although new, MAPit is rapidly being 
adapted to the study of diverse genomic loci (20-26). 
Related studies support the ability of MAPit to accurately 
map gaps between nucleosomes across the genome (27) 
and track heterogeneity of chromatin structure (28,29). 

Hypothesizing that KSHV populations would exhibit 
epigenetic drift, we examined promoters and 5' ends of 
genes of interest in latently infected cells by MAPit. The 
degree of diversity seen was surprising. Both fully closed 
and fully open conformations were present at three differ- 
ent KSHV promoters. Treatment with TP A and induced 
expression of the replication and transcription activator 
(RTA) were used to test which of the diverse chromatin 
structures found were linked to reactivation during lytic 
transcription. At the time point tested, only a subset of 
episomes appeared to have remodeled chromatin in 
response to lytic reactivation by both means. We also 
observed a subset of episomes associated with lytic repli- 
cation, and identified a link between chromatin compac- 
tion, CG methylation and inhibition of reactivation. These 
findings suggest that latent virus is subject to progressive 
epigenetic inactivation. 

MATERIALS AND METHODS 

Cell culture 

Body-cavity-based lymphoma cell line 1 (BCBL1) cells 
(30) were cultured in RPMI 1640 medium supplemented 



with 10% (v/v) fetal calf serum (FCS), 2mM glutamine, 
1 mM sodium pyruvate, 100 U/ml penicillin and 
lOOug/ml streptomycin at 37°C under 5% C0 2 . TREx 
BCBL1-RTA cells for doxycycline-induced expression 
of RTA (31) were cultured in RPMI 1640 medium 
supplemented with 10% (v/v) FCS or tetracycline-free 
FCS, 2mM glutamine, 1 mM sodium pyruvate, 
100 U/ml penicillin and 100|ig/ml streptomycin at 37°C 
under 5% C0 2 . Telomerase-immortalized vein endothe- 
lial long-term-infected (TIVE-LTC) cells (32) were 
cultured in Dulbecco's modified Eagle's medium 
(DMEM) supplemented with 10% (v/v) FCS, 4.5 g/1 
glucose, 2mM glutamine, 1 mM sodium pyruvate, 100 
U/ml penicillin and 100 (ig/ml streptomycin at 37°C 
under 5% C0 2 . 

For TPA-induced reactivation, BCBL1 cells were grown 
to a density of 2-3 x 10 5 cells/ml. TPA was added to a 
final concentration of 20ng/ml and cells were incubated 
at standard conditions (37°C, 5% C0 2 ) for 6h. Cells were 
pelleted at 2Alg for 5min and resuspended in the same 
volume of fresh RPMI medium and grown for another 
6h before nuclei isolation for MAPit. For reactivation 
by doxycycline-inducible RTA, TREx BCBL1-RTA cells 
were cultured for at least 2 days in tetracycline-free 
medium before performing the induction. Cells at a 
density of 4-5 x 10 5 per milliliter were then treated with 
1 (ig/ml doxycycline and 250 jiM foscarnet for 6 h. 
Foscarnet was used to suppress viral lytic replication; 
quantification of viral load by real-time polymerase 
chain reaction (PCR) indicated that viral replication was 
largely blocked (not shown). Cells were pelleted at 2Alg 
for 5min and resuspended in the same volume of fresh 
RPMI medium with 250 (iM foscarnet and grown for 
another 6h before nuclei isolation for MAPit. 
Uninduced BCBL1 or TREx BCBL1-RTA cells (treated 
with 250 uM foscarnet), respectively, served as the 0 h time 
point. 



MAPit single-molecule footprinting 

MAPit was performed as described previously (12,13). 
Briefly, BCBL1 or TREx BCBL1-RTA cells were har- 
vested at a density of 3-6 x 10 5 cells/ml, washed twice 
with ice-cold phosphate-buffered saline (PBS), counted 
and resuspended at 10 6 cells/ml in PBS. Four to five 
million cells were stored as pellet at — 80° C for later 
RNA extraction (see below). For each MAPit assay, 10 6 
cells were pelleted and resuspended in 200 ul cold Buffer A 
[(10 mM 4-(2-hydroxyethyl)- 1 -piperazineethanesulfonic 
acid (HEPES)-KOH, pH 7.5, 2 mM MgCl 2 , 10 mM 
KC1, 2mM dithiothreitol (DTT), 0.2 mM phenylmetha- 
nesulfonyl fluoride (PMSF)]. Cells were incubated on ice 
for lOmin to release nuclei by hypotonic lysis, then 
vortexed 10 s at medium velocity and centrifuged for 10 s 
at 16 000g. To check for complete cellular lysis, an aliquot 
of nuclei in 0.2% (w/v) trypan blue were examined by light 
microscopy. Nuclei were only used for MAPit if the lysis 
was complete, and nuclei were evenly round and granular 
in appearance. Nuclear integrity was also confirmed in 
parallel by Hoechst staining, which dyes DNA, and 
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fluorescence microscopy. Nuclei were washed twice with 
500 |il cell resuspension buffer [20 mM HEPES-KOH, 
pH 7.5, 70 mM NaCl, 0.25 mM ethylenediaminetetraacetic 
acid (EDTA), 0.5 mM ethylene glycol tetraacetic acid 
(EGTA), 0.5% (v/v) glycerol, 10 mM DTT, 0.25 mM 
PMSF] and resuspended in 90 ul methylation buffer (cell 
resuspension buffer supplemented with 160 uM S- 
adenosyl-L-methionine) . 

TIVE-LTC cells were harvested at 100% confluency, 
washed twice with ice cold PBS, counted and resuspended 
at 10 6 cells/ml in PBS. Cells were pelleted (5 x 10 6 cells for 
each MAPit assay), washed once in 250 ul cell resuspen- 
sion buffer and lysed in 100 ul cell lysis buffer [cell resus- 
pension buffer plus 0.25% (v/v) Nonidet P-40] for lOmin 
on ice. Nuclei were pelleted at lOOOg for 5 min at 4°C, then 
washed twice with 500 ul cell resuspension buffer and re- 
suspended in 90 ul methylation buffer. 

To methylate accessible GC sites, 30 U recombinant 
M.CviPI-maltose binding protein (MBP) fusion [New 
England Biolabs, hereafter M.CviPI; (14)] were added 
per 10 6 nuclei (based on cells counted after all washes 
completed), which were then incubated at 37°C for 
15 min. Reactions were stopped by addition of an equal 
volume of 1% (w/v) sodium dodecyl sulfate (SDS), 
lOOmM NaCl, 10 mM EDTA and incubated overnight at 
50°C with 100 ug/ml proteinase K. DNA was then phenol 
extracted (partitioned with an equal volume phenohchloro- 
form:isoamyl alcohol::25:24:l) and ethanol precipitated. A 
0 U M.CviPI control reaction was also performed to gauge 
endogenous GC methylation (not shown; none was 
observed at any locus). 

To ensure comparable results between experiments, 
M.CviPI activity was calibrated between aliquots of 
enzyme using methylation of a Haelll site in the 
GRP78 promoter in BCBL1 cells as a standard. MAPit 
experiments were also conducted at 10 rather than 70 mM 
NaCl, with 3-fold higher and lower effective M.CviPI 
concentration, and over a 3-fold range in M.CviPI incu- 
bation time (Supplementary Figures SI and S2). A 
non-linear relationship was seen between M.CviPI con- 
centration and degree of methylation. This matches a 
simple model in which DNA molecules adopt multiple 
chromatin conformations that vary in accessibility to 
M.CviPI. 

Restriction (R) endonuclease R.Haelll probing of nuclei 

Direct treatment of nuclei with R.Haelll confirmed 
relative differences in accessibility between the LAN A 
promoter and genomic loci (Supplementary Figure S3). 
Briefly, 4 x 10 5 nuclei were prepared as above and sus- 
pended in 100 |il 20 mM HEPES-KOH, pH 7.5, 70mM 
NaCl, 10 mM MgCl 2 , 0.5% (v/v) glycerol, 10 mM DTT, 
0.25 mM PMSF, plus 0-100 U R.Haelll (New England 
Biolabs) or 50% (v/v) glycerol in 10 ul. After 30 min at 
37°C, reactions were stopped by addition of an equal 
volume 1% (w/v) SDS, 100 mM NaCl, 10 mM EDTA 
and incubation for 20 min at 80°C. Genomic DNA was 
prepared as described above and percent digest quantified 
by real-time PCR. 



Bisulfite genomic sequencing 

To deaminate, 2-4 ug DNA was denatured for 5 min at 
95°C in freshly made 0.3 N NaOH, then incubated for 6 h 
at 50° C in a saturated metabisulfite solution as previously 
described (33). Bisulfite-converted DNA was desulfonated 
and concentrated with a commercial kit (Zymo Research 
catalog no. D5026). Target loci were amplified from 20- 
100 ng bisulfite-treated DNA with 3 U HotStar Taq DNA 
polymerase (Qiagen) and 250 nM primers in lOmM 
tris(hydroxymethyl)aminomethane (Tris)-HCl, pH 8.6, 
50 mM KC1, 3 mM MgCl 2 and 200 uM of each 2'- 
deoxynucleoside 5'-triphosphates (dNTP). Primers se- 
quences are given in Supplementary Table SI. 
Amplification was performed in 2-3 separate reactions 
to minimize stochastic variation in PCR amplification, 
then pooled for cloning. Cycling conditions were 5 min 
at 95°C, followed by 40 cycles of 1 min at 95°C, 1 min at 
56°C-60°C, (optimized for each primer pair), 3 min at 
72°C, and a final extension of 5 min at 72°C. 

Amplified DNA was separated from unincorporated 
primers by electrophoresis in 1% (w/v) agarose buffered 
with Tris-acetate-EDTA, excised and purified from the gel 
and ligated into a cloning vector using topoisomerase 
(TOPO TA; Invitrogen). Calcium-competent DHlOb or 
TOP 10 cells were transformed with 10-50 ng plasmid 
and plated on lysogeny broth (LB) medium with 100 jig/ 
ml ampicillin and 40 ug/ml 5-bromo-4-chloro-3-indolyl-|3- 
D-galactopyranoside (X-gal) for blue/white screening. 
Colony PCR with Ml 3 forward and reverse primers was 
also used to test cloning efficiency. Colonies containing 
the insert were suspended in 100 ul LB plus 100 ug/ml 
ampicillin and grown at 37°C overnight in 96- well plates 
without shaking. Glycerol was then added to 10% (v/v) 
and the plates were incubated at — 80°C for at least 2h, 
whereupon plasmid was subjected to TempliPhi™ rolling 
circle amplification and automated BigDye sequencing 
(Applied Biosystems) at the University of Florida 
Interdisciplinary Center for Biotechnology Research 
(http : / /www . bio tech .ufl . edu) . 

Sequence analysis 

The genomic sequences corresponding to BGS amplicons 
are listed in Supplementary Table S2. Sequence files in 
ABI format were aligned in Sequencher T (Genecodes) 
and are available on request. MethylViewer [http://dna. 
leeds.ac.uk/methylviewer; (22)] was used to ascertain 
percent cytosine conversion and tabulate methylation 
status of each site in each sequence read (i.e. molecule). 
Molecules with <95% conversion of cytosines outside po- 
tential methylation sites were discarded. Molecules from 
the same initial DNA sample that had identical patterns of 
both methylation and unconverted cytosines were 
presumed to be duplicate clones, and all copies but one 
were then discarded. Molecules with either zero methyla- 
tion or 100% conversion of cytosines were not screened in 
this way, as there were insufficient grounds to infer dupli- 
cation. Duplicates made up 0-10% of molecules in each 
dataset. 

The tables of methylation status were used to generate 
digital methylation maps for both CG and GC methylation 
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for each molecule in Excel (Microsoft). Within the LAN A, 
RTA and vIL6 promoters, CG methylation was rare; thus, 
GCG sites were counted as GC sites to increase resolution. 
For the amplicon overlapping the LAN A open reading 
frame (ORF), GCG sites were excluded from the analysis. 
Digital methylation maps were clustered hierarchically 
using Cluster [http://rana.lbl.gov/EisenSoftware.htm; (34)], 
and printed as color maps with MapleTree (http:// 
mapletree.sourceforge.net). Dendrograms were split by 
eye into clusters of interest. Molecules in each cluster 
originating from each experiment were counted. 
Expectation values for the observed distributions were 
computed as two- tailed Fisher's exact tests. To generate 
the accessibility profile (Figure 2D), the fraction of 
sequence reads bearing accessible spans of 1-100 or 
101-400 bp was computed at each base pair position. 
The difference in accessibility after TPA treatment was 
then subtracted from that before treatment. Likewise, to 
generate occupancy profiles (Figures 3D, 5D and 8D), the 
fraction of sequence reads bearing footprints of 4-39, 
40-100 or 101-200 bp was computed at each base pair 
position. Footprints extending to the ends of the amplicons 
were not counted. To get the difference in occupancy, the 
fraction for one experimental condition was then sub- 
tracted from the fraction for another. 

RNA preparation for real-time reverse transcription 
PCR (RT-PCR) 

BCBL1 and TREx BCBL1-RTA cells were treated as 
described above in two independent experiments 
(including different stocks of cells). From 4 to 5 x 10 6 
induced and uninduced cells, RNA was extracted using 
RNA Bee (TEL-TEST, Inc.) according to the manufac- 
turer's instructions. RNA was DNase I treated [10 jag 
RNA, 1 |il RNasin® (Promega), 1 x DNase I buffer, 2 jil 
(4 U) RNase-free DNase I (New England Biolabs) in 50 ul 
total volume] for 25 min at 37°C and recovered by ethanol 
precipitation after M6h at — 20°C. An aliquot of 
DNase I-digested RNA was tested for RNA quality by 
agarose gel electrophoresis and the rest stored at — 80°C. 
RNA (1-3 ug) was reverse transcribed using oligo dT 
primer and Invitrogen Superscript III reverse transcriptase 
(RT) in a 20 ul reaction mixture as indicated by the manu- 
facturer (20 ul reaction mixture per 1 ug RNA; for more 
RNA, the assay was scaled up accordingly). A reverse- 
transcription reaction without RT enzyme served as 
control (no RT). 

Quantitative real-time PCR 

According to the MIQE guidelines for quantitative 
real-time PCR [qPCR; (35)], we refer to reference genes 
instead of housekeeping genes and Cq (quantification 
cycle) instead of Ct (threshold cycle). All primers 
(Supplementary Table SI) were purchased from 
Integrated DNA Technologies, Inc. and tested for their 
efficiency (between 90 and 110%) and specificity 
(melting curve analysis). 

Per 12 ul qPCR reaction, 0.5 ul complementary DNA 
was used. Primers were used at a final concentration of 
200 nM. Two-step qPCR was performed with the Fast 



SYBR Green Master Mix and a StepOnePlus real-time 
PCR instrument (both Applied Biosy stems). Cycle condi- 
tions were initially 95°C for 10 min, 40 cycles of 95°C for 
15 s, 60°C for 1 min, followed by melting curve analysis. 

For quantification of M.CviPI-mediated protection 
from R.Haelll digest, 1 ug undeaminated DNA was 
digested with 10 U R.Haelll, as per the enzyme manufac- 
turer's directions, or mock-treated with an equal volume 
of 50% (v/v) glycerol. For each sample, three wells with 
lOng DNA apiece were quantified by real-time PCR, as 
above. 

RESULTS 

Multiple chromatin conformations in the KSHV 
latency-associated region 

LANA, vFLIP, vCyclin, Kaposin and 12 pre-microRNAs 
are highly expressed off polycistronic transcripts arising 
from a single KSHV latency-associated region (KLAR) 
in tumor cells. During latency, host RNA polymerase II 
initiates transcription of KLAR from the upstream latent 
transcription start site (TSS; Figure IB). Additionally, a 
second downstream TSS is used during lytic reactivation 
(36). Three CCCTC-binding factor (CTCF) binding sites 
are located between these two TSSs and the LANA open 
reading frame (ORF73); their occupancy by the chromatin 
insulator CTCF (37) contributes to LANA expression, 
mediates chromosomal looping and plays a role in 
episomal maintenance (38,39). All KSHV-infected cells 
express LANA, which is required for replication and 
maintenance of viral episomes in latently infected cells 
[reviewed in (40)]. Yet, not all of the ~80 episomes 
present in latently infected lymphoma cells need express 
LANA. We hypothesized that a subset of episomes might 
provide LANA function for the rest, which could be 
epigenetically inactivated. 

As a single-molecule technique, MAPit provided a way 
to test this hypothesis directly. A key advantage of MAPit, 
in contrast to population average readouts, is the ability to 
discover distinct subpopulations without enrichment. 
Nuclei from the KSHV-infected pleural effusion 
lymphoma (PEL) cell line BCBL1 were incubated with 0 
or 30 U M.CviPI, following the protocol outlined in 
Figure 1A. We cloned and sequenced 10 bisulfite- 
converted molecules encompassing the LANA promoter 
from each sample. The patterns of CG and GC methyla- 
tion were drawn by Methyl Viewer (22), which represents 
methylated cytosines as ball-and-stick symbols 
(Figure IB). Efficiency of conversion was estimated as 
percent HCH (H is any base but G) conversion in each 
sequence read (i.e. cytosine outside potential methylation 
sites; blue vertical ticks mark unconverted HCH). As 
expected, without M.CviPI addition, methylation of 
GCH sites was not above the background given by incom- 
plete conversion. The M.CviPI-probed sample exhibited 
prominent accessibility above background between and 
adjacent to three known CTCF sites. Some molecules 
were also accessible near the latent TSS. 

Although MethylViewer is excellent for comparing dif- 
ferent methylation types on single molecules, such detailed 
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Figure 1. MAPit analysis of chromatin structural diversity. (A) MAPit workflow diagram, adapted from Darst et al. (13). Chromatin (e.g. micro- 
somes) modulates accessibility of DNA sites to soluble proteins, such as (i) transcription factors (represented by shaded rectangle) in live cells, or (ii) 
exogenous M.CviPI DNMT in nuclei. Bisulfite treatment (hi) records M.CviPI accessibility pattern in the DNA sequence by conversion of 
unmethylated cytosine to uracil (cyan). Subsequent amplification and cloning (iv) allows recovery of this information from individual molecules. 
Note that endogenous and probe methylation sites are distinguished by sequence specificity of the respective DNMTs (respectively, cyan and 
magenta). The DNA sequences obtained are used to generate maps (v) of DNA methylation and accessibility. (B) MAPit at the KSHV gene 
LANA detected endogenous methylation and sporadic accessibility of the latent promoter. MAPit was performed in nuclei of BCBL1 cells. Top: map 
of genetic elements within the MAPit amplicon: LANA ORF (ORF73), three CTCF sites (ovals), lytic and latent TSSs (bent arrows) and TATA 
boxes (triangles). Middle: MethylViewer (22) display of MAPit results from control nuclei probed as indicated with 0 U or 30 U M.CviPI. Blue 
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graphs are unwieldy for large-scale analysis. We therefore 
devised a three-color map scheme (Figure 1C), used 
throughout the rest of this work. Patches of accessibility 
to DNMT (M.CviPI and endogenous DNMTs) received 
bright color (yellow and red, respectively), and patches of 
protection were colored black. Gray color denoted 
borders between methylated and unmethylated patches. 
We used unsupervised hierarchical clustering to order 
molecules, facilitating pattern recognition. Color maps 
for both GC and CG methylation status were clustered 
together. The 16 GCG sites were disregarded in this 
analysis, leaving 52 sites for GC and 36 for CG. 

The data showed 'banded' CG methylation across 
almost all molecules in the LAN A ORF. By contrast, 
the 10 molecules varied in degree and pattern of 
M.CviPI accessibility. This could reflect variation in chro- 
matin structure between episomes. An alternate explan- 
ation would be a poor nuclear preparation, such that the 
M.CviPI probe did not enter all nuclei. However, this 
seems unlikely as nuclei appeared intact and homogenous 
by vital (trypan blue) and Hoechst staining (not shown). 
Moreover, all 10 clones of the LAN A amplicon exhibited 
GC methylation above background unconverted HCH 
(Figure 1C, compare lower panels in 0 versus 30 U 
M.CviPI). In addition, we performed BGS at select host 
loci from the same sample. Two representative loci are 
shown (Figure ID). At the more open locus, the 
promoter of GRP78, all 17 molecules sequenced had GC 
methylation. This locus was previously found to be highly 
accessible to a DNMT probe in a colorectal line (28,29). 

Because MAPit requires amplification and cloning of 
bisulfite-converted DNA, it can be affected by sequence- 
specific biases in either step. We therefore measured 
M.CviPI methylation by R.Haelll digestion of undea- 
minated DNA from the same samples used for MAPit 
(Figure IE and Supplementary Figure S2). As R.Haelll 
does not digest its site GGCC when the central cytosine is 
methylated, the concordance between R.Haelll protection 
and BGS showed the latter to be unbiased. In turn, this 
meant that M.CviPI efficiently permeated nuclei and 
accessed the GRP78 promoter. We also probed nuclear 
DNA directly with R.Haelll, without prior M.CviPI 
treatment (Supplementary Figure S3). The restriction 
enzyme, which binds DNA cooperatively as a dimer and 
thus is similar in size to the monomeric M.CviPI fusion 
protein, respectively, 74.2 and 83.7 kDa, was able to access 



and cut its sites on most molecules. Thus, variation at 
LAN A was not due to uneven probing of nuclei. 

As indicated by the second locus shown, the GAPDH 
promoter, heterogeneity of promoter chromatin structure 
was not unique to LAN A. Variation in chromatin state 
between copies of the same gene can arise from several 
causes. Epigenetic variation occurs in X inactivation, im- 
printing and monoallelic expression. Dynamic variation 
occurs in periodic and sporadic cycles of chromatin re- 
modeling and transcription [for instance, (41)]. However, 
none of these phenomena have been described for a viral 
gene such as LAN A. In any case, the implication would be 
that the promoters in the closed conformation do not 
support transcription. Thus, as hypothesized, not all 
episomes continuously express LAN A. Because all 
infected cells express LAN A (32), this result implies het- 
erogeneity of episomes within individual nuclei. 

MAPit survey of LANA promoter architecture 

In addition to the latent TSS, the LANA promoter uses a 
lytic TSS on reactivation (36). Furthermore, transcription 
starting at the latent TSS overlaps the lytic K14 promoter 
on the opposite strand. We asked whether MAPit could 
visualize changes in chromatin structure within this 
complex regulatory region during lytic reactivation 
induced by phorbol ester treatment. To test this, we 
cloned and sequenced 102 amplified deaminated LANA 
promoter molecules from M.CviPI- treated BCBL1 nuclei 
(Oh or uninduced), and 110 molecules from 
M.CviPI-treated nuclei harvested after 12 h of TP A treat- 
ment of cells (12 h or induced). Induced transcription of 
two early lytic genes, RTA (ORF50) and vIL6 (ORF K2), 
was demonstrated by real-time RT-PCR (Supplementary 
Figure S4). An early time point was chosen to obtain in- 
complete lytic reactivation. Otherwise, histone-depleted 
products of viral DNA replication would out-compete 
cloning of BGS products derived from chromatinized 
viral episomes (4,42). Hence, these induction conditions 
did not have an obvious effect on M.CviPI accessibility 
averaged across the population (Figure 2A and B). 

We therefore asked whether differences might be 
detected in a single-molecule analysis. To determine 
which methylation patterns were enriched in one dataset 
relative to the other, we pooled all reads from the 0 - and 
12-h TPA treatment time points and clustered them hier- 
archically as GC methylation plots (Figure 2C). We 



Figure 1. Continued 

vertical ticks indicate positions of unconverted cytosines, excluding GC and CG sites (i.e. HCH); percent conversion of all HCH cytosines for each 
sequence clone is given on the right. Bottom (key): Circles and triangles, respectively, indicate CG and GC sites. Methylated sites are shaded. 
Methylated GCG sites are shaded gray to indicate their ambiguity; however, comparison of sequences from nuclei with and without M.CviPI 
treatment (middle, lower and upper panels) indicated that GCG sites within the LANA ORF were endogenously methylated. (C) Condensed view of 
the data in B, to scale with locus map at top. Each row of pixels represents one cloned sequence, i.e. molecule. Each plot tracks only GC or CG 
methylation; GCG sites are ignored. As indicated by key at bottom, spans of color mark >2 contiguous methylated sites; black marks >2 contiguous 
unmethylated sites; gray marks spans between methylated and unmethylated sites; white indicates missing or unaligned sequence. Positions of sites 
are indicated by circles and triangles (HCG and GCH, respectively). (D) Host promoters varied in degree of accessibility to M.CviPI. Maps of GC 
methylation at two representative promoters are shown, to scale with LANA amplicon. (E) MAPit at the three loci shown was semi-quantitative. At 
the Haelll sites indicated in red at GAPDH, GRP78 and LANA, 10 of 20, 15 of 17 and 1 of 10 molecules had a methylated GC site, respectively 
(blue bars). R.Haelll digestion followed by quantitative real-time PCR was used to measure bulk methylation of the same three sites in genomic 
DNA purified from M.CviPI-treated nuclei (red bars). Under these conditions, ~95% of DNA from untreated nuclei (i.e. unmethylated) was digested 
(not shown). The concordance between BGS and R.Haelll digest indicated that representation of methylated and unmethylated molecules in MAPit 
was unbiased. 
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plotted accessibility to M.CviPI in each molecule as blue 
(Oh) or yellow (12 h) to mark its source and split the den- 
drogram into clusters of interest. To better visualize 
differences in chromatin structure between clusters, each 
was rescaled vertically. The fraction of molecules present 
in each cluster from each time point is given on the right. 
The KLAR amplicon in this experiment was positioned 
farther upstream from the LAN A gene than that 
analyzed in Figure 1, allowing better characterization of 
accessibility at the LAN A latent TSS. Strikingly, the three 
CTCF sites were seen to be occupied by footprints in 
almost all molecules. Even in cluster i, which had 
reduced M.CviPI methylation overall, the three CTCF 
sites were dimly outlined. Importantly, this demonstrates 
that M.CviPI permeated these nuclei, but was preferen- 
tially excluded from the latent TSS (especially compared 
with clusters Hi and iv). 

For each cluster, we found the probability P of having 
so skewed a distribution of molecules from the two 
samples by chance. Although clustering revealed several 
distinct chromatin conformations, TPA treatment 
increased representation of molecules only in cluster Hi 
(by 13%) with borderline significance (P = 0.05). To 
further examine TPA-induced changes in LAN A 
promoter chromatin structure, we calculated accessibility 
of lengths 1-100 and 101-400 bp occurring at each 
position, and subtracted accessibility before TPA treat- 
ment (Oh) from that after (12 h) (Figure 2D). TPA treat- 
ment decreased accessibility adjacent to the CTCF sites 
(1-100 bp, red curve) and just downstream of the LAN A 
latent TSS (101-400 bp, blue curve) in -10-15% of the 
molecules, corresponding well with the number of mol- 
ecules TPA increased in cluster Hi. We conclude that 
TPA increased protection at the K14 TATA element, 
mainly in cluster Hi. 

Clustering also highlighted the large degree of diversity 
of chromatin structure at this locus. Both fully protected 
and almost fully accessible molecules were detected. 
Cluster ix was of particular interest as it appeared to be 
nearly nucleosome free. Complete chromatin unfolding 
and nucleosome displacement across a region of this size 
has not been observed at any human chromosomal locus 
studied by MAPit. We hypothesize that molecules of 
cluster ix originated from spontaneous lytic reactivation 
of KSHV, which occurs within 1-3% of PEL cells 
(30,43,44). As is the case for other herpesviruses, the 
KSHV genome is depleted for histones during lytic repli- 
cation before being packaged into virions (4,42). These 
results indicated that MAPit could discover chromatin 
conformations comprising <~5% of a complex 
population. 

A subset of episomes undergoes changes in chromatin 
structure during reactivation 

The master regulator of lytic transcription is the replica- 
tion and transcription activator RTA (45). RTA directly 
activates several downstream viral genes, such as vIL6, as 
well as transcription of its own gene (46). We hypothesized 
that a subset of latent episomes would be poised to tran- 
scribe RTA (ORF50) and that these would have a distinct 



chromatin structure visible by MAPit. Induced reactiva- 
tion would be predicted to shift these molecules to the 
actively transcribing state, which might involve nucleo- 
some repositioning. We therefore cloned and sequenced 
114 and 112 molecules from M.CviPI-treated nuclei har- 
vested after 0- or 12-h TPA treatment, respectively 
(Figure 3). 

Accessibility to CG and GC DNMTs was computed at 
each position. Within ORF48 (5' of RTA), a peak of CG 
methylation was observed upstream (3 13-284 bp upstream 
of the RTA TSS) of a positioned nucleosome, hereafter 
N— 1 (Figure 3 A and B). The same position had been 
previously shown to be endogenously methylated in 
BCBL1, although in a larger proportion of the population 
(53). A second notable feature of this plot was a broad 
peak in GC accessibility centered ~ 1 00-bp upstream of the 
RTA TSS. An open chromatin structure was not expected 
to be so common in the latent state. Moreover, there was 
no obvious difference between GC accessibility profiles 
before and after TPA treatment (Figure 3A). Thus, the 
bulk of the population was unchanged. As expected, the 
GAPDH (Supplementary Figure S2C) and GRP78 control 
loci also showed no change in chromatin architecture in 
response to TPA (compare Supplementary Figure S5 with 
Figure ID, upper). 

As before, molecules from the two time points were 
clustered together to identify chromatin conformations 
enriched in either (Figure 3C). Cluster i, comprising 
~50% of all molecules, was, as at KLAR, largely (but 
not completely) inaccessible to M.CviPI and presumably 
transcriptionally inactive. Clusters ii-v all had an N — 1 
footprint. Clusters ii and Hi appeared similar to the eye: 
both featured three evenly spaced positioned 
nucleosome-sized footprints. However, cluster Hi was 
more closely related to cluster v than ii in the original 
dendrogram (Supplementary Figure S6). The central 
N + 1 nucleosome in cluster ii covered several previously 
identified transcription factor binding sites upstream of 
the RTA TSS. The N + 1 and N + 2 nucleosomes seemed 
to shift slightly downstream in cluster Hi versus ii. Cluster 
iv had a variably positioned N — 1 footprint, followed 
downstream by a large open gap, then 2-3 variably pos- 
itioned nucleosome-sized footprints. Cluster v molecules 
shared a large M.CviPI-accessible linker upstream of 
N — 1 , with chromatin that appeared to be denser down- 
stream over the RTA transcribed region. Cluster vi lacked 
N — 1 . Lastly, cluster vii was, just as cluster v found at 
KLAR, almost entirely accessible and again comprised 
~3% of all molecules. 

Clusters ii, v and vi had P values below the usual thresh- 
old for significance (<0.05). Because clusters H and vi 
decreased in response to TPA, they may represent tran- 
scriptionally poised states, primed to respond to activating 
stimuli. Cluster iv would then correspond to a transcrip- 
tionally active state or a thermodynamically stable stage in 
the cycle of RNA polymerase recruitment and release. 
Although it had too few molecules for statistical signifi- 
cance, cluster Hi was also enriched for molecules from 
TPA-treated cells and might also represent a stage in the 
transcription cycle. Comparing the increase in cluster v to 
the decrease in clusters ii and vi, we conclude that at least 
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Figure 2. TPA treatment altered KLAR chromatin structure in ~10% 
of BCBL1 episomes. Note that this KLAR amplicon, although 
overlapping the amplicon of Figure 1, was centered upstream to 
better observe accessible DNA at the latent TSS. (A) Profile of en- 
dogenous CG (0-h TPA treated, red; 12-h TPA treated, green) and 
exogenous GC methylation (0-h TPA treated, black; 12 h TPA 
treated, blue) across the locus. GCG sites were not counted. 
(B) Schematic of the analyzed amplicon. The LANA ORF begins left 
of the amplicon. The LANA latent and lytic TSSs are indicated. TATA 
boxes are indicated by triangles (upside down, latent and lytic LANA; 
righted, ORF K14). (C) All GC methylation plots for 0- and 12-h TPA 
treatment (blue and yellow, respectively), including GCG, as m 5 CG was 
rare in (A), were organized by unsupervised hierarchical clustering. GC 
sites are indicated above; plot is to scale with other panels. 
Dendrogram was split into nine fractions of interest. On right, propor- 
tion of sequence reads from each data set present in each cluster 
(columns do not sum to 100% owing to rounding). Expectation value 
P is the chance that as uneven a distribution of molecules from the two 
datasets would occur if molecules had been assigned to the cluster at 
random. (D) Difference in accessibility from 0- to 12-h of TPA treat- 
ment. Cumulative accessibility in two size ranges, 1-101 and 101— 
400 bp, was tallied at every position in each dataset, then the 0-h 
profile was subtracted from the 12-h profile. 
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Figure 3. Induced lytic reactivation altered chromatin state frequency 
at RTA. (A) Profile of average endogenous CG (0-h TPA treatment, 
red; 12-h TPA treatment, green) and exogenous GC methylation (0-h 
TPA treatment, black; 12-h TPA treatment, blue) across the locus. 
GCG sites were not counted. Note locus of highest CG methylation 
coincided with a peak of GC methylation; elsewhere, CG methylation 
was minimal. (B) Diagram of RTA locus amplified. Triangles mark 
some of the known or suggested transcription factor-binding sites: 
C/EBP (C) (47), OCT-1 (O) (48), YY1 (Y) (49), AP-1 (A) (50), Spl/3 
(S) (2), XBP-1 (X) (51) and TATA (T). A second RTA TSS was 
recently reported roughly 500-bp upstream of this amplicon (52). 
Below, inferred nucleosome positions are indicated as two overlapping 
gray ovals, each 150-bp long. (C) Clustered GC methylation maps from 
0-h (blue) and 12-h (yellow) treatment with TPA. Methylation of GC 
sites (including GCG, as m 5 CG was rare) was plotted as in Figure 1, to 
scale with panels (A, B and D). Position of GC sites is indicated by 
hashes at top. Maps from both data sets were sorted together by un- 
supervised, hierarchical clustering. Seven clusters of interest are pre- 
sented. On right are given the proportion of sequence reads from 
each data set present in each cluster, and estimate of expectation 
value P assigned as in Figure 2. Columns do not sum to 100% due 
to rounding. (D) Difference in occupancy from 0 to 12 h of TPA treat- 
ment. Footprints in two size ranges, 4-39 and 121-200 bp, were 
counted at every position in each dataset, then the 0-h profile was 
subtracted from the 12-h profile. An intermediate size range, 
40-120 bp, showed no difference between the data sets. 
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10% of episomes remodeled RTA chromatin in response 
to TPA. By the same token, a large fraction of molecules 
appeared not to have responded to TPA at this time point, 
including those of cluster i, which were highly compacted. 

Several small footprints covering 1-2 GC sites occurred 
at 93, 86 and 74 bp upstream of the TSS (Supplementary 
Figure S6), which might represent some of the transcrip- 
tion factors reported to bind this area. To determine which 
footprints were more abundant in each dataset, we 
counted footprints of lengths 4-39 and 121-200 bp 
occurring at each position and subtracted footprint 
counts before TPA treatment from those after 
(Figure 3D). The result is difference in occupancy of 
factors within each footprint size range at each base pair 
position. This analysis revealed greater definition of the 
N — 1 footprint (broad peak in black spanning from 
—300 to —100 bp relative to the RTA TSS), repositioning 
of nucleosomes inside the transcribed region of RTA 
(sharp peaks on right), and changes in some 
sub-nucleosomal footprints flanking these areas (in red), 
but not changes in the putative transcription factor foot- 
prints. Nucleosome repositioning in the gene was consist- 
ent with the hypothesized shift from cluster ii to Hi. 

Together, these results suggested that only a subset of 
KSHV episomes had responded to TPA by reorganizing 
RTA promoter chromatin by 12 h of TPA induction. 
From this, one would predict a similar proportion of 



episomes to be expressing a downstream target of RTA, 
an immediate-early gene such as vIL6. We cloned 96 and 
71 molecules amplified from bisulfite-converted DNA 
isolated from TPA-treated and untreated cells, respect- 
ively. Unlike at RTA, at the vIL6 promoter (within 
ORF2), there was a wide peak of accessibility to 
M.CviPI (Figure 4 A and B). As with RTA, no change in 
bulk accessibility was seen on TPA treatment from the 
methylation plots alone. However, clustering of molecules 
once again showed otherwise. As at KLAR and RTA, 
there was both an abundant, highly inaccessible, likely 
silenced cluster (Figure 4C, cluster i) and a rare, highly 
accessible cluster (v), suggesting that the underlying chro- 
matin behavior was episome wide. Again, neither of these 
two clusters changed significantly on TPA treatment. 
However, cluster iv, lacking two or more nucleosomes 
within the vIL6 promoter, did increase significantly from 
8 to 24% of molecules after induction (P = 0.01). This 
suggests chromatin remodeling in response to activation 
by RTA. Moreover, these numbers were consistent with 
the estimate that ~10% of episomes were transcribing 
RTA at this time point. The observation that 8% of mol- 
ecules showed the same vIL6 chromatin accessibility 
during latency is in congruence with a recent report of 
low levels of vIL6 expression during latency (54). As at 
RTA, results were consistent with a model of heteroge- 
neous reactivation of the lytic pathway, in which an 
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Figure 4. TPA treatment altered chromatin structure at the vIL6 gene in 10-15% of BCBL1 episomes. (A) Profile of average endogenous CG (0-h 
TPA treatment, red; 12-h TPA treatment, green) and exogenous GC methylation (0-h TPA treatment, black; 12-h TPA treatment, blue) across the 
locus. (B) Diagram of vIL6 locus amplified. Little has been reported of the structure of the vIL6 promoter, except for the two shown TSSs (bent 
arrows) (74). (C) All GC methylation plots for 0- and 12-h TPA treatment (blue and yellow, respectively, including GCG) were organized by 
unsupervised hierarchical clustering. GC sites are indicated above; plot is to scale with other panels. Dendrogram was split into five fractions of 
interest. On right, proportion of sequence reads from each data set present in each cluster, and expectation value P calculated as in Figure 2. 
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open chromatin structure such as in Figure 3C, cluster vi is 
prerequisite. We conclude that chromatin structure reflects 
ability of episomes to respond to inducers of lytic 
transcription. 

As a protein kinase C agonist, TPA induces weak sig- 
naling that may resemble a pathway for spontaneous 
KSHV reactivation in vivo. To further examine the 
extent to which the fraction of KSHV episomes with rela- 
tively inaccessible chromatin (i.e. in cluster i) was resistant 
to lytic reactivation, we performed an experiment with 
TREx BCBL1-RTA cells. These cells, containing an 
integrated doxycycline-inducible RTA transgene, over- 
express RTA after 12 h of treatment with doxycycline 
and hence undergo efficient reactivation (31,55). We col- 
lected cells after 12 h of induction and performed MAPit, 
using primers specific to the wild-type KSHV RTA 
promoter. We cloned 51 and 45 molecules amplified 
from bisulfite-converted DNA isolated from doxycycline- 
treated and untreated cells, respectively (Figure 5). Fold 
induction of lytic genes was 30- and 6-fold higher for RTA 
and vIL6 transcripts, respectively, in doxycycline-treated 
TREx BCBL1-RTA cells than had been observed in 
parental BCBL1 cells treated with TPA (Supplementary 
Figure S4). 

Several differences in RTA promoter chromatin archi- 
tecture were also observed between the cell lines. In the 
TREx BCBL1-RTA cell line, the upstream peak of CG 
methylation was absent (compare Figures 5 A and 3B, red 
and green curves), and a footprint spanned the two GC 
sites within the Spl binding site in 10-20% of molecules 
[Figure 5B and C, under triangle labeled 'S'; (2)]. 
Although this footprint was slightly more common on 
RTA induction, the increase did not pass the threshold 
for significance (P = 0.4). 

As before, we noted a subset of episomes with highly 
accessible chromatin, cluster v. These were more abundant 
than we had observed in the parent cell line BCBL1, 
possibly due to higher basal expression of RTA in the 
untreated TREx BCBL1-RTA cells. Consistent with this, 
5.6-fold more RTA transcript was observed in uninduced 
(tetracycline-free FCS) TREx BCBL1-RTA versus 
BCBL1 cells (not shown). 

Three clusters of molecules with intermediate accessibil- 
ity to M.CviPI were found, ii-iv (Figure 5C). As seen in 
the presence of phorbol ester, induction of RTA in trans 
caused a significant shift in abundance between two 
clusters (ii and Hi). However, in this case, the fraction of 
molecules affected was larger (30%, i.e. increase in cluster 
Hi versus decrease in clusters i and ii), and the 
doxycycline-induced state (cluster Hi) was more accessible 
to M.CviPI than had been seen with TPA induction 
(Figure 3, cluster iv). Displacement of N — 1 occurred in 
20% of molecules (Figure 5D), consistent with the shift of 
at least 20% of molecules to cluster Hi. Increased chroma- 
tin remodeling in TREx BCBL1-RTA is consistent with 
the higher and more persistent levels of RTA expression 
than are observed with TPA-induced reactivation 
(31,56,57). Alternatively, dissimilarities between the two 
experimental systems may reflect differences in transcrip- 
tion factor and/or co-activator recruitment. TPA brings 



AP-1 to the promoter, but direct RTA induction acts 
through C/EBP (47,50). 

As in the BCBL1 cells assayed in Figure 3, a large 
fraction of cloned sequences showed promoters with 
compact chromatin, inaccessible to M.CviPI (Figure 5C, 
cluster i). The fraction of KSHV episomes with closed 
chromatin did not significantly decrease in response to 
RTA induction in trans despite 30-fold increased RTA ex- 
pression as compared with TPA induction. Thus, MAPit 
indicated heterogeneous reactivation of RTA in highly 
induced TREx BCBL1-RTA cells, just as in TPA-treated 
BCBL1 cells. 
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Figure 5. Expression of a doxycycline-inducible RTA transgene altered 
chromatin structure at the endogenous RTA promoter of 30^0% of 
episomes. (A) Profile of endogenous CG (uninduced, red; 12 h of doxy- 
cycline, green) and exogenous GC methylation (uninduced, black; 12 h 
of doxycycline, blue). GCG sites were not counted. (B) Schematic of 
the RTA locus, as in Figure 3B. (C) GC methylation plots for 0- and 
12-h RTA induction (blue and yellow, respectively). GC sites are 
indicated above; plot is to scale with other panels. Dendrogram was 
split into five fractions of interest. On right, proportion of sequence 
reads from each data set present in each cluster, and expectation value 
P calculated, as in Figure 2. (D) Difference in footprint occupancy 
from 0 to 12 h of TPA treatment. Footprints in two size ranges, 4-39 
and 121-200 bp, were counted at every position in each dataset, then 
the 0-h profile was subtracted from the 12-h profile. 
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Epigenetic diversity in latent KSHV 

Our results suggested that the promoters of LAN A, RTA 
and vIL6 existed in at least two states across the popula- 
tion of episomes. The first had compact chromatin and 
was non-responsive for lytic reactivation, and the second 
exhibited various open conformations. However, it was 
not yet clear whether these states were epigenetic or 
reflected periodic or sporadic cycles of chromatin remodel- 
ing and transcription. Heritability of chromatin conform- 
ations identified by MAPit can be inferred by correlations 
between GC methylation and CG methylation, which is 
epigenetic. Indeed, a patch of endogenous CG methyla- 
tion occurred upstream of the RTA promoter in 25% of 
BCBL1 molecules sequenced (Figures 3 A and 6A and B). 
Note that clusters shown in Figure 6B were based solely 
on HCG methylation, i.e. excluding all GCG sites. Thus, 
they do not correspond to the GCH methylation clusters 
in Figure 3C. To test whether the latter were epigenetically 
distinct, we counted molecules with upstream m 5 CG 
patches in each M.CviPI methylation-defined cluster 
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Figure 6. Correlations between CG methylation and chromatin struc- 
ture observed by MAPit. (A) Schematic of the analyzed amplicon. 
Symbols are as defined in Figure 3B. (B) CG methylation plot 
indicating localized patch of CG methylation upstream of RTA TSS. 
These are CG methylation plots of the same molecules as Figure 3C, 
0-h treatment (not counting GCG). Unsupervised hierarchical cluster- 
ing revealed that ~25% of molecules shared a patch of CG methylation 
(red) ~300bp upstream of the RTA TSS. Positions of CG sites are 
indicated by hashes at top. (C) Link between CG methylation and 
MAPit structure at RTA. All molecules with 2-3 methylated CG sites 
within the linker were counted for each MAPit cluster defined in 
Figure 2C, as a percent of the total molecules in the cluster for each 
time point. Clusters ii-vi were grouped because some had as few as 
three molecules from a time point, increasing variance observed. 
Asterisk indicates data used to calculate P<0.01 that constant CG 
methylation between clusters could generate observed distribution. 



(Figure 6C). This feature was defined as methylation of 
2-3 of the CG sites 284-313 bp upstream of the RTA TSS. 

From both TPA-treated and untreated cells, 30% of 
cluster i molecules in Figure 3C had upstream m 5 CG 
patches. However, only 15% of molecules from clusters 
ii-vi and 0% from cluster vii had upstream m 5 CG patches. 
Clearly, lytic induction did not affect abundance of 
upstream m 5 CG patches in either group of molecules. 
Pooling the two experimental conditions, we computed 
P<0.01 that CG methylation would differ so much by 
chance between clusters i and ii-vi. Hence, the presence 
of m 5 CG patches upstream of RTA was anti-correlated 
with openness of nearby chromatin to GC methylation. 
Furthermore, epigenetic 'fingerprinting' showed that 
changes in cluster abundance on TPA treatment were pri- 
marily due to flux in molecules lacking the upstream CG 
methylation (Supplementary Table S3). This indicated 
that upstream-methylated RTA promoters were not sig- 
nificantly remodeled in response to TPA. Thus, CG 
methylation fingerprinting indicates that different MAPit 
patterns arose from epigenetically distinct subpopulations 
at RTA. These data indicate an epigenetic basis for the 
chromatin structural diversity observed at RTA, 
implying long-term stability. 

Altered chromatin dynamics on constitutively 
latent episomes 

Unlike BCBL1, TIVE-LTC do not reactivate either spon- 
taneously or in response to TPA (32). We therefore 
decided to study chromatin conformation in TIVE-LTC 
as a model for purely latent episomes, which are transcrip- 
tionally silenced except for the LAN A region. For com- 
parison with 94 TPA-untreated LAN A reads from latent 
BCBL1, we cloned and sequenced 107 from TIVE-LTC 
cells. To determine the extent to which TIVE-LTC and 
BCBL1 episomes had chromatin structures in common, 
we pooled all GC methylation plots from the two 
datasets and clustered them hierarchically (Figure 7). 

We divided the dendrogram into clusters of interest as 
before and calculated P values for under- or overrepre- 
sentation of molecules in each cluster from each cell 
type. There was a significant increase in locus-wide pro- 
tection (46-78%) of LAN A in TIVE-LTC compared with 
BCBL1 cells (cluster i, P = 10 -4 ). This was not caused by 
reduced nuclear permeation by M.CviPI, as the control 
locus GAPDH was, if anything, more open in 
TIVE-LTC cells (Supplementary Figure S2C). TIVE- 
LTC cells also produced fewer molecules in which nucleo- 
somes were shifted upstream to cover the latent TSS 
(Figure 7B, cluster ii). In addition, there was a significant 
reduction from 37 to 16% in the number of molecules with 
accessible chromatin over the latent TSS in BCBL1 cells 
(cluster Hi, P = 1 x 10 -4 ). However, CTCF occupancy 
was unaltered between the cell lines, as observed from 
molecules exhibiting accessible DNA around the CTCF 
sites (clusters ii and Hi). Finally, we observed no highly 
accessible molecules in the TIVE-LTC cells (cluster iv). 
Combining data from both the LAN A and RTA loci 
(below) yielded P = 0.01 for the presence of highly open 
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Figure 7. Chromatin structure at KLAR in TIVE-LTC compared with 
BCBL1 cells. (A) Schematic of the locus. The LANA ORF begins left of 
the amplicon. The latent and lytic TSSs are indicated. TATA boxes are 
indicated by triangles with orientations as in Figure 1A. (B) All GC 
methylation plots for BCBL1 and TIVE-LTC (yellow and blue, respect- 
ively, including GCG) were organized by unsupervised hierarchical 
clustering. GC sites are indicated above; plot is to scale with the 
other panel. Dendrogram was split into four fractions of interest. On 
right, proportion of sequence reads from each data set present in each 
cluster, and estimate of expectation value P calculated as in previous 
figures. Column for TIVE-LTC cells does not sum to 100% owing to 
rounding. 



presumably lytic molecules in BCBL1, but not in 
TIVE-LTC, cells. 

We also compared the 114 TPA-untreated RTA reads 
from BCBL1 to 105 from M.CviPI- treated TIVE-LTC 
nuclei, and plotted the percent methylation at each site 
in the amplicon (Figure 8 A). Surprisingly, KSHV 
episomes from TIVE-LTC cells, which except for the 
LANA region are transcriptionally silenced (32), had less 
overall endogenous CG methylation than did those from 
BCBL1 cells at the RTA locus, and lacked the upstream 
peak (Figure 8A, -313 to -284). The GC accessibility 
profile was different as well. TIVE-LTC episomes had a 
different nucleosome organization with less accessible 
chromatin at the RTA promoter (Figure 8A, —200 to 
+ 100 relative to the RTA TSS), but slightly more 
observed downstream (Figure 8 A, between +100 and 
+200 bp from the RTA TSS). Subtracting BCBL1 foot- 
prints from TIVE-LTC footprints (Figure 8D), we found 
differences in frequency at which nucleosomes were foot- 
printed in the RTA promoter (broad black peaks between 
—250 and +100 bp). Increased positioning of the N+l 
nucleosome centered upstream of the RTA TSS in 
TIVE-LTC cells corresponded with an increase in an ac- 
cessible linker downstream, which was occupied by 
sub-nucleosomal footprints of variable size (red peak 
centered at +109, cf. cluster Hi in panel C). Thus, the 
bulk accessibility data suggested that the RTA locus 
was more compact in TIVE-LTC than in BCBL1 cells. 
This compaction of the RTA promoter may account for 
the inability of TIVE-LTC cells to spontaneously 
reactivate. 
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Figure 8. Silent chromatin at RTA promoter in TIVE-LTC cells. 
TPA-untreated BCBL1 MAPit reads from Figure 3 were compared 
with MAPit reads from TIVE-LTC cells. (A) Profile across RTA 
promoter of average endogenous CG (green) and exogenous GC 
(blue) methylation in TIVE-LTC cells, as well as CG (red) and exogen- 
ous GC (black) methylation in BCBL1 cells from Figure 2A for com- 
parison. Note absence of 'linker' H-m 5 CG peak at —300 bp in 
TIVE-LTC cells, despite the presence of an overlapping G-m 5 CH 
peak, on left side of graph. (B) Diagram of RTA locus amplicon and 
inferred nucleosome positions, as in Figure 2, to scale with other 
panels. (C) Clustered GC (including GCG) methylation maps from 
BCBL1 (yellow) and TIVE-LTC (blue) presented as in Figure 2. Five 
clusters of interest are presented. On right, proportion of sequence 
reads from each data set present in each cluster, and estimate of ex- 
pectation value P calculated as in Figure 2. (D) Difference in frequency 
of footprints of two size ranges from 0 to 12 h of TP A calculated as in 
Figure 2. 



DISCUSSION 

We applied single-molecule MAPit footprinting to inter- 
rogate the chromatin structures of viral episomes in 
KSHV-infected cells of lymphoid and endothelial origin, 
which contain ~80 and 5 copies per cell, respectively. 
Three loci that differed with respect to expression were 
analyzed: the highly transcriptionally active LANA 
promoter, the inactive but poised RTA promoter, and 
the moderately active but highly RTA-responsive vIL6 
promoter (58). At all three loci, MAPit revealed that 
KSHV episomes within latently infected PEL cells exist 
in a complex mix of chromatin conformations. Our data 
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suggest that, as evidenced by chromatin architecture, only 
a fraction of episomes participate in latent gene expression 
and in reactivation at any given time. Links between nu- 
cleosome positioning and CG methylation provided novel 
evidence that epigenetic drift restricts ability of virus to 
reactivate lytic transcription. We discuss how this drift 
may affect important steps within the y-herpesvirus life 
cycle, such as long-term maintenance of latency and the 
rate and ability to reactivate lytic replication. 

Heterogeneity in chromatin accessibility 

MAPit was originally developed as a probe of heterogen- 
eity in chromatin remodeling (28). We have now detected 
diverse chromatin states at two human and three viral loci 
by MAPit. These results were reproducible, verified by 
restriction endonuclease digestion and consistent with 
previous single-molecule studies of promoter function 
(12,28,29). 

Heterogeneity in chromatin architecture can occur at 
even constitutively active promoters, as transcription 
factors dissociate, bind, recruit remodelers and so forth. 
However, molecules without accessibility to M.CviPI are 
presumably inaccessible to other soluble factors as well. 
Thus, it is difficult to envision how the transcriptional 
machinery could be recruited to the M.CviPI-inaccessible 
copies of the promoters GAPDH, LAN A and so forth. On 
the other hand, the half-life of this compacted state in live 
cells is unknown. It may or may not in all cases be due to 
long-term silencing. However, as discussed below, at the 
RTA promoter, at least some episomes were in an 
epigenetically distinct state with reduced local accessibility 
to M.CviPI. 

Broad peaks of accessibility to M.CviPI, i.e. 
nucleosome-free regions typically associated with tran- 
scriptionally active or poised promoters, were identified 
near each of the TSSs. In latently infected cells, the 
NFR at the LAN A promoter co-localizes with active 
histone modifications (H3K9/K14ac and H3K4me3), but 
lacks detectable repressive marks (H3K27me3 and 
H3K9me3) (3,4), (R.R, unpublished data). In the 
absence of stable repressive marks, it is surprising that 
the NFR was essentially absent from 40 to 50% of 
episomes in BCBL1 (Figure 2C) and a larger fraction 
(-80%) in TIVE-LTC cells (Figure 7B). However, at 
present, it is not clear if the inaccessible fraction either 
lacked positive marks or, alternatively, possessed 
positive marks and lacked an NFR needed to support 
assembly of the preinitiation complex. 

Mechanism of lytic reactivation 

Spontaneous viral reactivation occurs as a result of leaky 
RTA expression. We analyzed chromatin structures 
during lytic reactivation by both TPA and induction of 
a conditional RTA transgene. Both latent and lytic 
episomes were observed. The latter, comprising ~3% 
(16% in TREx BCBL1-RTA cells with higher basal 
RTA expression) of molecules sequenced, were distin- 
guished by their high chromatin accessibility and were 
absent in TIVE-LTC, in which KSHV cannot reactivate. 
We hypothesize that these highly GC-methylated reads 



derive from products of lytic replication and therefore 
largely lack nucleosomes. Although certain related 
viruses have been reported to incorporate nucleosomes 
during lytic replication (59,60), our results suggest that 
lytically replicated KSHV DNA is largely protein free. 

Some of the other clusters might correspond to chroma- 
tin conformations present earlier in the lytic cycle. Indeed, 
MAPit revealed open chromatin at the RTA promoter in 
latently infected BCBL1 cells. Nucleosomes N— 1 and 
N + 1 were both occasionally displaced (Figure 3C, 
clusters iv and vi; Figure 5, clusters Hi and iv). To link 
any of these observed conformations with transcription, 
RTA was induced either by TPA or by RTA 
overexpression and resulting changes in chromatin states 
were determined. Under the two conditions, different 
classes of intermediate-open promoter molecules were 
observed to change in abundance at RTA. Those that 
decreased might have been poised for reactivation, 
whereas those that increased may represent stages in 
cycles of active transcription. 

In both induction experiments, a pool of molecules with 
a partly accessible RTA promoter was depleted in 
response to lytic reactivation (cluster ii in Figures 3 and 
5). Promoters with this chromatin architecture were there- 
fore probably primed or poised to respond to activators of 
RTA. Such molecules comprised 10-20% of the total 
population. As spontaneous reactivation is rare, it 
follows that many of the intermediates we observed 
required some additional event to progress to reactivation. 
Interestingly, the promoters for RTA and vIL6, both 
targets of RTA, have been shown to have trimethylation 
of both H3K4 and H3K27 (3,4) (R.R., unpublished data), 
the bivalent mark associated with developmentally poised 
genes (61). Trimethylation of H3K27, which is associated 
with transcriptional repression, is lost quickly on RTA 
(and vIL6) promoter activation (3,4). Loss of this mark 
may allow episomes with an open RTA promoter to 
progress to lytic reactivation. Alternatively, the mark 
may be ancillary. In either case, our results show that re- 
activation as monitored by increased chromatin accessibil- 
ity is limited to a subset of episomes, even when induced 
by overexpression of RTA. Increased accessibility in a 
limited subset of KSHV copies is consistent with the 
finding that histone H3 binding was not depleted at the 
RTA and vIL6 promoters under identical conditions of 
reactivation (4). An uneven distribution of episomes 
potent for reactivation between cells might contribute to 
the low rate of spontaneous reactivation, as seen both in 
BCBL1 cells and in endothelial cells within KS tumors 
(62). 

Cytosine methylation on linker DNA in KSHV 

Bands of CG methylation separated by a nucleosome 
particle length of DNA occurred across nearly all 
episomes in the population downstream of the three 
CTCF sites in the LAN A gene (Figure IB and C). It was 
unlikely coincidence that GC methylation peaked in the 
same locations (our unpublished observations). CTCF is 
known to organize arrays of nucleosomes (63). We there- 
fore propose that nucleosomes downstream of CTCF in 
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the LAN A ORF are highly positioned. Another potential 
example would be the methylation patch just upstream of 
the N — 1 nucleosome in the RTA promoter. A corollary 
conclusion is that DNMTs methylate linker, but not nu- 
cleosomal DNA, in KSHV. This conclusion contradicts a 
report of preferential methylation of nucleosomal DNA 
within Arabidopsis (64). Curiously, only linkers between 
nucleosomes were methylated. No patch of CG methyla- 
tion appeared in an open TSS or between CTCF sites. Nor 
were all linkers sites of CG methylation: although the 5' 
end of RTA ORF was occupied by nucleosomes at varying 
translational positions, little CG methylation occurred 
there. Finally, TIVE-LTC cells, which appeared to have 
the highest nucleosome density (i.e. least accessibility to 
M.CviPI) did not develop patches of CG methylation. 
Evidently, although CG methylation was restricted to 
linker DNA, it had other unknown determinants as well, 
post-translational modifications of histones perhaps. 
Indeed, several DNMTs interact with histone tails 
[reviewed in (65)]. 

Epigenetic diversity in KSHV 

The discovery of partial CG methylation upstream of 
RTA provided a means to test whether the diverse chro- 
matin states that we observed in BCBL1 episomes were 
epigenetic. Giinther and Grundoff (3) showed that repres- 
sion of RTA in latency was independent of DNA methy- 
lation in PEL and SLK cells. We have now observed the 
same in TIVE-LTC cells (Figure 8A). Thus, we do not 
conclude that methylation of 2-3 sites upstream of the 
RTA promoter affects transcription of the locus. 
However, we hypothesize that, as DNA methylation is 
epigenetic, the sites can be used to 'fingerprint' distinct 
epigenetic lineages within the BCBL1 viral population. 

At the RTA promoter, the degree of upstream CG 
methylation correlated with the chromatin conformation 
identified by MAPit. Whereas 31% of episomes in the 
most compact chromatin state (Figure 3C, cluster i) had 
the upstream patch of CG methylation, only 15% of mol- 
ecules in the intermediate-open conformations (clusters 
ii-vi) did (Figure 6B). This uneven distribution suggested 
that there were at least two epigenetic states within 
latently infected BCBL1 cells. Furthermore, these states 
likely differed in their propensity to reactivate lytic expres- 
sion, as cluster i molecules were unchanged by addition 
of TP A. 

Because the TREx BCBL1-RTA cells, cloned from 
BCBL1, lacked the upstream patch of CG methylation, 
we could not test whether resistance to direct RTA induc- 
tion was epigenetic. However, it was interesting to note 
that -40% of TREx BCBL1-RTA endogenous RTA pro- 
moters were in a compact state, and the proportion in this 
state was not significantly affected by induction of RTA in 
trans. 

Several latent states — LI, L2 and L3 — have been 
defined in a close relative to KSHV, the Epstein-Barr 
virus (EBV). These states are distinguished by expression 
of different subsets of the latent genes. In EBV, CTCF 
restricts spread of DNA methylation (66). Rare events in 
which CTCF function is compromised could allow EBV to 



drift between the three latent states. Likewise, it has 
been proposed that KSHV might have multiple levels of 
latency (67). Our data suggest three distinct latent states 
(Figure 9). The deepest level of latency, L0, was seen in 
TIVE-LTC, which had completely lost their ability to re- 
activate. Interestingly, TIVE-LTC cells had little 
trimethylation of H3K4 at the RTA promoter (R.R., un- 
published data), lacking the bivalent state previously 
identified there (3,4). The RTA promoter was occasionally 
accessible in TIVE-LTC, but never as much as the most 
open RTA promoters in BCBL1. Either lack of H3K4 
trimethylation, presence of H3K27 trimethylation or 
some other local factor may have prevented the degree 
of promoter chromatin disassembly needed for reactiva- 
tion of the gene. 

The next level of latency, LI, occurring in BCBL1 
episomes, was associated with CG methylation upstream 
of the RTA locus and tended to have a closed chromatin 
structure. The third level of latency, L2, had the most 




Figure 9. Model for three distinct epigenetic states of KSHV defined 
by MAPit at RTA promoter. (A) In the most open latent state, Latent 2 
(L2), unfolding of higher-order chromatin structure transiently exposes 
the TSS and an upstream linker. With induced or spontaneous reacti- 
vation, transcription factors (polygon labeled TF) replace repressors 
(not shown) near the TSS. This leads to lytic replication, producing 
nucleosome-free DNA (far right). (B) The exposed linker in L2 can be 
CG methylated (triangles) by endogenous DNMTs. This methylation 
correlates with a shift to another epigenetic state, Latent 1 (LI), in 
which chromatin tends to be more closed. LI virus responds to 
phorbol ester induction of lytic transcription weakly or not at all. (C) 
A third state, Latent 0 (L0), occurred in TIVE-LTC cells. Although in 
these cells, different stages of chromatin folding were observed, neither 
DNMTs nor transcription factors were able to access internucleosomal 
linker DNA, perhaps due to the presence (or absence) of specific 
histone modifications. L0 cannot be induced to reactivate by phorbol 
ester. 
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open structure at RTA, with the promoter poised for 
reactivation. We have not yet determined the range of 
chromatin conformations accessed by each epigenetic 
state, only that the LI state is more likely to have a 
closed chromatin conformation at RTA. It seems likely 
that the same episomes have compact chromatin at the 
LAN A and vIL6 promoters, given that the proportions 
in the closed state were the same at all three loci. 

We hypothesize that infrequent methylation of RTA 
promoter DNA accompanied chromatin compaction and 
the transition from L2 to LI. Our findings suggest that the 
gradual methylation of episomes following infection (3) is 
linked to phenotypic changes. DNA methylation, at least 
at the RTA promoter, was associated with loss of both 
chromatin plasticity (Supplementary Table S3) and acces- 
sibility to soluble factors (e.g. M.CviPI). Progressive 
silencing of KSHV episomes would serve as a natural 
defense for the host cell. It could eventually lead to repres- 
sion of LAN A and loss of infection, a phenomenon 
observed in vitro (68,69). These results support a model 
originally proposed by Grundhoff and Ganem (2004) (68) 
in which lytic reactivation, albeit infrequent, is required 
for life-long persistence of KSHV infection. Recently, 
Kobiler et al. (70), using the Brainbow cassette to trace 
the lineage of viral genomes during lytic replication, 
demonstrated that only 6-10 genomes within each cell 
(i.e. <1%) gave rise to progeny virus. Our observations 
suggest that a similar restriction may apply to KSHV re- 
activation from latency. 
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